Semantic Understanding and Commonsense Reasoning in an Adaptive Photo Agent

نویسندگان

  • Xinyu Hugo Liu
  • Arthur C. Smith
چکیده

In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually annotate photos with keywords greatly burdens the user. Furthermore, even when photos are properly annotated, their retrieval is often very brittle because semantic connections between annotations and the story text that are "obvious" to people (e.g. between “bride” and “wedding”) may easily be missed by the computer. ARIA (Annotation and Retrieval Integration Agent) is a software agent that acts as an assistant to a user writing e-mail or Web pages. As the user types a story, it does continuous retrieval and ranking on a photo database. It can use descriptions in the story text to semi-automatically annotate pictures based on how they are used. The focus of this thesis is threefold: Improving ARIA’s automated annotation capabilities through world-aware semantic understanding of the text; making photo retrieval more robust by using a commonsense knowledge base, Open Mind Commonsense, to make semantic connections between the story text and annotations (e.g. connect "bride" and "wedding"); and learning personal commonsense through the text (e.g. “My sister’s name is Mary.”) that can then be used to improve photo retrieval by enabling personalized semantic connections. Thesis Supervisor: Dr. Henry Lieberman Title: Research Scientist, MIT Media Laboratory

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semantic Understanding and Commonsense Reasoning

In a story telling authoring task, an author often wants to set up meaningful connections between different media, such as between a text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. Expecting the user to manually...

متن کامل

Adaptive Linking between Text and Photos Using Common Sense Reasoning

In a hypermedia authoring task, an author often wants to set up meaningful connections between different media, such as text and photographs. To facilitate this task, it is helpful to have a software agent dynamically adapt the presentation of a media database to the user's authoring activities, and look for opportunities for annotation and retrieval. However, potential connections are often mi...

متن کامل

Visual common-sense for scene understanding using perception, semantic parsing and reasoning

In this paper we explore the use of visual commonsense knowledge and other kinds of knowledge (such as domain knowledge, background knowledge, linguistic knowledge) for scene understanding. In particular, we combine visual processing with techniques from natural language understanding (especially semantic parsing), common-sense reasoning and knowledge representation and reasoning to improve vis...

متن کامل

A Distributed Architectural Strategy towards Ambient Intelligence

This work reveals the benefits obtained from combining commonsense reasoning and multi-agent systems on top of a fully equipped middleware platform. The architecture here proposed is founded on the service composition paradigm, as the comprehensive solution to relieve users from being involved in system decision making. In this regard, the environment and domain understanding is emulated by the...

متن کامل

Understanding Stories with Large-Scale Common Sense

Story understanding systems need to be able to perform commonsense reasoning, specifically regarding characters’ goals and their associated actions. Some efforts have been made to form large-scale commonsense knowledge bases, but integrating that knowledge into story understanding systems remains a challenge. We have implemented the Aspire system, an application of large-scale commonsense knowl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001